Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 100 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 11.1 KiB |
| Average record size in memory | 113.3 B |
Variable types
| Categorical | 3 |
|---|---|
| Numeric | 11 |
title has a high cardinality: 100 distinct values | High cardinality |
artist has a high cardinality: 64 distinct values | High cardinality |
energy is highly correlated with loudness.dB and 1 other fields | High correlation |
loudness.dB is highly correlated with energy | High correlation |
acousticness is highly correlated with energy | High correlation |
year is highly correlated with length | High correlation |
energy is highly correlated with loudness.dB and 1 other fields | High correlation |
loudness.dB is highly correlated with energy and 1 other fields | High correlation |
length is highly correlated with year | High correlation |
acousticness is highly correlated with energy and 1 other fields | High correlation |
top genre is highly correlated with artist and 1 other fields | High correlation |
artist is highly correlated with top genre and 1 other fields | High correlation |
title is highly correlated with top genre and 1 other fields | High correlation |
title is highly correlated with artist and 12 other fields | High correlation |
artist is highly correlated with title and 10 other fields | High correlation |
top genre is highly correlated with title and 8 other fields | High correlation |
year is highly correlated with title and 2 other fields | High correlation |
beats.per.minute is highly correlated with title and 3 other fields | High correlation |
energy is highly correlated with title and 5 other fields | High correlation |
danceability is highly correlated with title and 4 other fields | High correlation |
loudness.dB is highly correlated with title and 3 other fields | High correlation |
liveness is highly correlated with title and 2 other fields | High correlation |
valance is highly correlated with title and 1 other fields | High correlation |
length is highly correlated with title and 2 other fields | High correlation |
acousticness is highly correlated with title and 3 other fields | High correlation |
speechiness is highly correlated with title and 4 other fields | High correlation |
popularity is highly correlated with title and 1 other fields | High correlation |
title is uniformly distributed | Uniform |
title has unique values | Unique |
acousticness has 8 (8.0%) zeros | Zeros |
Reproduction
| Analysis started | 2021-10-07 17:09:28.137389 |
|---|---|
| Analysis finished | 2021-10-07 17:10:08.982441 |
| Duration | 40.85 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 100 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 928.0 B |
| Blinding Lights | 1 |
|---|---|
| One Kiss (with Dua Lipa) | 1 |
| 7 Years | 1 |
| Don't Let Me Down | 1 |
| Sorry | 1 |
| Other values (95) |
Length
| Max length | 62 |
|---|---|
| Median length | 13 |
| Mean length | 16.01 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 100 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Blinding Lights |
|---|---|
| 2nd row | Watermelon Sugar |
| 3rd row | Mood (feat. iann dior) |
| 4th row | Someone You Loved |
| 5th row | Perfect |
Common Values
| Value | Count | Frequency (%) |
| Blinding Lights | 1 | 1.0% |
| One Kiss (with Dua Lipa) | 1 | 1.0% |
| 7 Years | 1 | 1.0% |
| Don't Let Me Down | 1 | 1.0% |
| Sorry | 1 | 1.0% |
| New Rules | 1 | 1.0% |
| Attention | 1 | 1.0% |
| I'm Yours | 1 | 1.0% |
| Old Town Road - Remix | 1 | 1.0% |
| Youngblood | 1 | 1.0% |
| Other values (90) | 90 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 11 | 3.8% | |
| feat | 10 | 3.4% |
| you | 8 | 2.8% |
| me | 8 | 2.8% |
| with | 6 | 2.1% |
| i | 6 | 2.1% |
| the | 5 | 1.7% |
| like | 5 | 1.7% |
| remix | 5 | 1.7% |
| don't | 4 | 1.4% |
| Other values (198) | 222 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 64 |
|---|---|
| Distinct (%) | 64.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 928.0 B |
| Post Malone | 7 |
|---|---|
| Ed Sheeran | 5 |
| The Weeknd | 4 |
| Imagine Dragons | 4 |
| Shawn Mendes | 3 |
| Other values (59) |
Length
| Max length | 23 |
|---|---|
| Median length | 11 |
| Mean length | 10.85 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 45 ? |
|---|---|
| Unique (%) | 45.0% |
Sample
| 1st row | The Weeknd |
|---|---|
| 2nd row | Harry Styles |
| 3rd row | 24kGoldn |
| 4th row | Lewis Capaldi |
| 5th row | Ed Sheeran |
Common Values
| Value | Count | Frequency (%) |
| Post Malone | 7 | 7.0% |
| Ed Sheeran | 5 | 5.0% |
| The Weeknd | 4 | 4.0% |
| Imagine Dragons | 4 | 4.0% |
| Shawn Mendes | 3 | 3.0% |
| Billie Eilish | 3 | 3.0% |
| Maroon 5 | 3 | 3.0% |
| The Chainsmokers | 3 | 3.0% |
| Justin Bieber | 3 | 3.0% |
| Travis Scott | 2 | 2.0% |
| Other values (54) | 63 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| the | 8 | 4.2% |
| post | 7 | 3.7% |
| malone | 7 | 3.7% |
| sheeran | 5 | 2.6% |
| ed | 5 | 2.6% |
| weeknd | 4 | 2.1% |
| 5 | 4 | 2.1% |
| imagine | 4 | 2.1% |
| justin | 4 | 2.1% |
| dragons | 4 | 2.1% |
| Other values (103) | 139 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 34 |
|---|---|
| Distinct (%) | 34.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 928.0 B |
| dance pop | |
|---|---|
| pop | |
| dfw rap | |
| modern rock | |
| canadian pop | |
| Other values (29) |
Length
| Max length | 25 |
|---|---|
| Median length | 9 |
| Mean length | 9.87 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | 21.0% |
Sample
| 1st row | canadian contemporary r&b |
|---|---|
| 2nd row | pop |
| 3rd row | cali rap |
| 4th row | pop |
| 5th row | pop |
Common Values
| Value | Count | Frequency (%) |
| dance pop | 28 | |
| pop | 11 | 11.0% |
| dfw rap | 7 | 7.0% |
| modern rock | 6 | 6.0% |
| canadian pop | 6 | 6.0% |
| canadian contemporary r&b | 4 | 4.0% |
| electropop | 4 | 4.0% |
| melodic rap | 3 | 3.0% |
| latin | 2 | 2.0% |
| folk-pop | 2 | 2.0% |
| Other values (24) | 27 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| pop | 50 | |
| dance | 30 | |
| rap | 18 | 9.6% |
| canadian | 12 | 6.4% |
| rock | 8 | 4.3% |
| dfw | 7 | 3.7% |
| hop | 6 | 3.2% |
| hip | 6 | 3.2% |
| modern | 6 | 3.2% |
| r&b | 4 | 2.1% |
| Other values (29) | 41 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 14 |
|---|---|
| Distinct (%) | 14.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2015.96 |
| Minimum | 1975 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | 1975 |
|---|---|
| 5-th percentile | 2012 |
| Q1 | 2015 |
| median | 2017 |
| Q3 | 2018 |
| 95-th percentile | 2020 |
| Maximum | 2021 |
| Range | 46 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 5.327496806 |
|---|---|
| Coefficient of variation (CV) | 0.002642659977 |
| Kurtosis | 37.37937618 |
| Mean | 2015.96 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -5.40386338 |
| Sum | 201596 |
| Variance | 28.38222222 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=14)
| Value | Count | Frequency (%) |
| 2018 | 20 | |
| 2019 | 16 | |
| 2016 | 16 | |
| 2015 | 13 | |
| 2017 | 12 | |
| 2014 | 6 | 6.0% |
| 2013 | 4 | 4.0% |
| 2020 | 3 | 3.0% |
| 2021 | 3 | 3.0% |
| 2012 | 3 | 3.0% |
| Other values (4) | 4 | 4.0% |
| Value | Count | Frequency (%) |
| 1975 | 1 | 1.0% |
| 1995 | 1 | 1.0% |
| 2004 | 1 | 1.0% |
| 2008 | 1 | 1.0% |
| 2012 | 3 | 3.0% |
| 2013 | 4 | 4.0% |
| 2014 | 6 | 6.0% |
| 2015 | 13 | |
| 2016 | 16 | |
| 2017 | 12 |
| Value | Count | Frequency (%) |
| 2021 | 3 | 3.0% |
| 2020 | 3 | 3.0% |
| 2019 | 16 | |
| 2018 | 20 | |
| 2017 | 12 | |
| 2016 | 16 | |
| 2015 | 13 | |
| 2014 | 6 | 6.0% |
| 2013 | 4 | 4.0% |
| 2012 | 3 | 3.0% |
| Distinct | 56 |
|---|---|
| Distinct (%) | 56.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 116.97 |
| Minimum | 71 |
|---|---|
| Maximum | 186 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | 71 |
|---|---|
| 5-th percentile | 79.95 |
| Q1 | 95 |
| median | 115 |
| Q3 | 135.25 |
| 95-th percentile | 171 |
| Maximum | 186 |
| Range | 115 |
| Interquartile range (IQR) | 40.25 |
Descriptive statistics
| Standard deviation | 27.47062894 |
|---|---|
| Coefficient of variation (CV) | 0.2348519188 |
| Kurtosis | -0.4028637027 |
| Mean | 116.97 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.5907689298 |
| Sum | 11697 |
| Variance | 754.6354545 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 90 | 6 | 6.0% |
| 100 | 6 | 6.0% |
| 120 | 5 | 5.0% |
| 136 | 4 | 4.0% |
| 95 | 3 | 3.0% |
| 124 | 3 | 3.0% |
| 98 | 3 | 3.0% |
| 102 | 3 | 3.0% |
| 125 | 3 | 3.0% |
| 108 | 2 | 2.0% |
| Other values (46) | 62 |
| Value | Count | Frequency (%) |
| 71 | 1 | |
| 75 | 1 | |
| 76 | 1 | |
| 77 | 1 | |
| 79 | 1 | |
| 80 | 1 | |
| 83 | 2 | |
| 84 | 2 | |
| 85 | 1 | |
| 89 | 1 |
| Value | Count | Frequency (%) |
| 186 | 1 | |
| 178 | 2 | |
| 174 | 1 | |
| 171 | 2 | |
| 170 | 1 | |
| 168 | 1 | |
| 160 | 2 | |
| 155 | 2 | |
| 151 | 1 | |
| 150 | 2 |
| Distinct | 50 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 62.68 |
| Minimum | 11 |
|---|---|
| Maximum | 92 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 36.8 |
| Q1 | 52 |
| median | 64.5 |
| Q3 | 76 |
| 95-th percentile | 85.05 |
| Maximum | 92 |
| Range | 81 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 16.49173653 |
|---|---|
| Coefficient of variation (CV) | 0.2631100276 |
| Kurtosis | -0.1864652981 |
| Mean | 62.68 |
| Median Absolute Deviation (MAD) | 12.5 |
| Skewness | -0.5067749133 |
| Sum | 6268 |
| Variance | 271.9773737 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 73 | 6 | 6.0% |
| 56 | 5 | 5.0% |
| 80 | 4 | 4.0% |
| 54 | 4 | 4.0% |
| 61 | 4 | 4.0% |
| 52 | 3 | 3.0% |
| 82 | 3 | 3.0% |
| 59 | 3 | 3.0% |
| 79 | 3 | 3.0% |
| 78 | 3 | 3.0% |
| Other values (40) | 62 |
| Value | Count | Frequency (%) |
| 11 | 1 | |
| 26 | 1 | |
| 30 | 1 | |
| 32 | 1 | |
| 33 | 1 | |
| 37 | 1 | |
| 38 | 2 | |
| 39 | 2 | |
| 40 | 2 | |
| 41 | 1 |
| Value | Count | Frequency (%) |
| 92 | 1 | 1.0% |
| 91 | 1 | 1.0% |
| 90 | 1 | 1.0% |
| 87 | 1 | 1.0% |
| 86 | 1 | 1.0% |
| 85 | 1 | 1.0% |
| 83 | 2 | |
| 82 | 3 | |
| 81 | 1 | 1.0% |
| 80 | 4 |
| Distinct | 46 |
|---|---|
| Distinct (%) | 46.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.96 |
| Minimum | 35 |
|---|---|
| Maximum | 91 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | 35 |
|---|---|
| 5-th percentile | 40.85 |
| Q1 | 59 |
| median | 69 |
| Q3 | 77 |
| 95-th percentile | 85.05 |
| Maximum | 91 |
| Range | 56 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 13.6040101 |
|---|---|
| Coefficient of variation (CV) | 0.2031662202 |
| Kurtosis | -0.2485413235 |
| Mean | 66.96 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -0.623671297 |
| Sum | 6696 |
| Variance | 185.0690909 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=46)
| Value | Count | Frequency (%) |
| 75 | 7 | 7.0% |
| 78 | 6 | 6.0% |
| 73 | 5 | 5.0% |
| 69 | 4 | 4.0% |
| 61 | 4 | 4.0% |
| 51 | 3 | 3.0% |
| 79 | 3 | 3.0% |
| 77 | 3 | 3.0% |
| 59 | 3 | 3.0% |
| 66 | 3 | 3.0% |
| Other values (36) | 59 |
| Value | Count | Frequency (%) |
| 35 | 2 | |
| 36 | 1 | |
| 37 | 1 | |
| 38 | 1 | |
| 41 | 1 | |
| 42 | 2 | |
| 44 | 1 | |
| 45 | 2 | |
| 48 | 1 | |
| 50 | 1 |
| Value | Count | Frequency (%) |
| 91 | 1 | 1.0% |
| 90 | 1 | 1.0% |
| 88 | 1 | 1.0% |
| 87 | 1 | 1.0% |
| 86 | 1 | 1.0% |
| 85 | 3 | |
| 84 | 1 | 1.0% |
| 83 | 2 | |
| 82 | 2 | |
| 80 | 1 | 1.0% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -6.1 |
| Minimum | -14 |
|---|---|
| Maximum | -3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 100 |
| Negative (%) | 100.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | -14 |
|---|---|
| 5-th percentile | -10 |
| Q1 | -7 |
| median | -6 |
| Q3 | -5 |
| 95-th percentile | -3 |
| Maximum | -3 |
| Range | 11 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.987333628 |
|---|---|
| Coefficient of variation (CV) | -0.3257923981 |
| Kurtosis | 1.889514027 |
| Mean | -6.1 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.9935568513 |
| Sum | -610 |
| Variance | 3.949494949 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) |
| -6 | 25 | |
| -5 | 20 | |
| -7 | 16 | |
| -4 | 13 | |
| -8 | 9 | 9.0% |
| -3 | 7 | 7.0% |
| -10 | 4 | 4.0% |
| -9 | 3 | 3.0% |
| -11 | 2 | 2.0% |
| -14 | 1 | 1.0% |
| Value | Count | Frequency (%) |
| -14 | 1 | 1.0% |
| -11 | 2 | 2.0% |
| -10 | 4 | 4.0% |
| -9 | 3 | 3.0% |
| -8 | 9 | 9.0% |
| -7 | 16 | |
| -6 | 25 | |
| -5 | 20 | |
| -4 | 13 | |
| -3 | 7 | 7.0% |
| Value | Count | Frequency (%) |
| -3 | 7 | 7.0% |
| -4 | 13 | |
| -5 | 20 | |
| -6 | 25 | |
| -7 | 16 | |
| -8 | 9 | 9.0% |
| -9 | 3 | 3.0% |
| -10 | 4 | 4.0% |
| -11 | 2 | 2.0% |
| -14 | 1 | 1.0% |
| Distinct | 32 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.86 |
| Minimum | 3 |
|---|---|
| Maximum | 79 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 10 |
| median | 12 |
| Q3 | 17.25 |
| 95-th percentile | 39.05 |
| Maximum | 79 |
| Range | 76 |
| Interquartile range (IQR) | 7.25 |
Descriptive statistics
| Standard deviation | 12.97240272 |
|---|---|
| Coefficient of variation (CV) | 0.7694189039 |
| Kurtosis | 7.248451807 |
| Mean | 16.86 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.492425144 |
| Sum | 1686 |
| Variance | 168.2832323 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=32)
| Value | Count | Frequency (%) |
| 9 | 13 | |
| 11 | 13 | |
| 10 | 11 | 11.0% |
| 12 | 6 | 6.0% |
| 13 | 5 | 5.0% |
| 14 | 5 | 5.0% |
| 15 | 5 | 5.0% |
| 8 | 4 | 4.0% |
| 16 | 4 | 4.0% |
| 7 | 3 | 3.0% |
| Other values (22) | 31 |
| Value | Count | Frequency (%) |
| 3 | 1 | 1.0% |
| 5 | 1 | 1.0% |
| 6 | 2 | 2.0% |
| 7 | 3 | 3.0% |
| 8 | 4 | 4.0% |
| 9 | 13 | |
| 10 | 11 | |
| 11 | 13 | |
| 12 | 6 | |
| 13 | 5 | 5.0% |
| Value | Count | Frequency (%) |
| 79 | 1 | |
| 67 | 1 | |
| 56 | 1 | |
| 55 | 1 | |
| 40 | 1 | |
| 39 | 1 | |
| 37 | 2 | |
| 35 | 1 | |
| 34 | 2 | |
| 32 | 2 |
| Distinct | 58 |
|---|---|
| Distinct (%) | 58.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.97 |
| Minimum | 6 |
|---|---|
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 15.95 |
| Q1 | 33.75 |
| median | 48 |
| Q3 | 66 |
| 95-th percentile | 86.1 |
| Maximum | 93 |
| Range | 87 |
| Interquartile range (IQR) | 32.25 |
Descriptive statistics
| Standard deviation | 21.7378574 |
|---|---|
| Coefficient of variation (CV) | 0.4350181589 |
| Kurtosis | -0.7642314509 |
| Mean | 49.97 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 0.1094557968 |
| Sum | 4997 |
| Variance | 472.5344444 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 45 | 5 | 5.0% |
| 33 | 3 | 3.0% |
| 42 | 3 | 3.0% |
| 64 | 3 | 3.0% |
| 43 | 3 | 3.0% |
| 22 | 3 | 3.0% |
| 49 | 3 | 3.0% |
| 75 | 3 | 3.0% |
| 73 | 3 | 3.0% |
| 24 | 2 | 2.0% |
| Other values (48) | 69 |
| Value | Count | Frequency (%) |
| 6 | 1 | 1.0% |
| 12 | 1 | 1.0% |
| 13 | 1 | 1.0% |
| 14 | 1 | 1.0% |
| 15 | 1 | 1.0% |
| 16 | 1 | 1.0% |
| 17 | 2 | |
| 18 | 1 | 1.0% |
| 20 | 2 | |
| 22 | 3 |
| Value | Count | Frequency (%) |
| 93 | 2 | |
| 91 | 1 | 1.0% |
| 90 | 1 | 1.0% |
| 88 | 1 | 1.0% |
| 86 | 2 | |
| 85 | 1 | 1.0% |
| 84 | 2 | |
| 80 | 1 | 1.0% |
| 79 | 1 | 1.0% |
| 75 | 3 |
| Distinct | 67 |
|---|---|
| Distinct (%) | 67.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 214.53 |
| Minimum | 119 |
|---|---|
| Maximum | 354 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | 119 |
|---|---|
| 5-th percentile | 171.75 |
| Q1 | 190.5 |
| median | 210 |
| Q3 | 234.25 |
| 95-th percentile | 270 |
| Maximum | 354 |
| Range | 235 |
| Interquartile range (IQR) | 43.75 |
Descriptive statistics
| Standard deviation | 35.93497354 |
|---|---|
| Coefficient of variation (CV) | 0.1675055868 |
| Kurtosis | 2.344361334 |
| Mean | 214.53 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 0.8176253281 |
| Sum | 21453 |
| Variance | 1291.322323 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 207 | 3 | 3.0% |
| 234 | 3 | 3.0% |
| 242 | 3 | 3.0% |
| 230 | 3 | 3.0% |
| 174 | 3 | 3.0% |
| 177 | 3 | 3.0% |
| 209 | 3 | 3.0% |
| 223 | 2 | 2.0% |
| 196 | 2 | 2.0% |
| 259 | 2 | 2.0% |
| Other values (57) | 73 |
| Value | Count | Frequency (%) |
| 119 | 1 | 1.0% |
| 141 | 1 | 1.0% |
| 157 | 1 | 1.0% |
| 158 | 1 | 1.0% |
| 167 | 1 | 1.0% |
| 172 | 1 | 1.0% |
| 173 | 1 | 1.0% |
| 174 | 3 | |
| 177 | 3 | |
| 178 | 1 | 1.0% |
| Value | Count | Frequency (%) |
| 354 | 1 | |
| 321 | 1 | |
| 313 | 1 | |
| 282 | 1 | |
| 270 | 2 | |
| 263 | 1 | |
| 259 | 2 | |
| 258 | 1 | |
| 257 | 1 | |
| 253 | 2 |
| Distinct | 50 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.95 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 8 |
| Zeros (%) | 8.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 4 |
| median | 13 |
| Q3 | 41.5 |
| 95-th percentile | 75.45 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 37.5 |
Descriptive statistics
| Standard deviation | 26.2787601 |
|---|---|
| Coefficient of variation (CV) | 1.053256918 |
| Kurtosis | 0.03092218018 |
| Mean | 24.95 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 1.049044333 |
| Sum | 2495 |
| Variance | 690.5732323 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 8 | 8.0% |
| 1 | 8 | 8.0% |
| 3 | 5 | 5.0% |
| 5 | 5 | 5.0% |
| 11 | 4 | 4.0% |
| 7 | 4 | 4.0% |
| 19 | 3 | 3.0% |
| 59 | 3 | 3.0% |
| 8 | 3 | 3.0% |
| 2 | 3 | 3.0% |
| Other values (40) | 54 |
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 1 | 8 | |
| 2 | 3 | 3.0% |
| 3 | 5 | |
| 4 | 2 | 2.0% |
| 5 | 5 | |
| 6 | 2 | 2.0% |
| 7 | 4 | |
| 8 | 3 | 3.0% |
| 9 | 2 | 2.0% |
| Value | Count | Frequency (%) |
| 98 | 1 | |
| 93 | 1 | |
| 92 | 1 | |
| 84 | 2 | |
| 75 | 1 | |
| 70 | 1 | |
| 69 | 1 | |
| 64 | 1 | |
| 63 | 1 | |
| 62 | 1 |
| Distinct | 31 |
|---|---|
| Distinct (%) | 31.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.93 |
| Minimum | 2 |
|---|---|
| Maximum | 46 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 6 |
| Q3 | 11 |
| 95-th percentile | 32.05 |
| Maximum | 46 |
| Range | 44 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 9.424077266 |
|---|---|
| Coefficient of variation (CV) | 0.9490510842 |
| Kurtosis | 3.805475159 |
| Mean | 9.93 |
| Median Absolute Deviation (MAD) | 2.5 |
| Skewness | 2.031738877 |
| Sum | 993 |
| Variance | 88.81323232 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=31)
| Value | Count | Frequency (%) |
| 4 | 15 | |
| 3 | 15 | |
| 5 | 15 | |
| 7 | 8 | 8.0% |
| 6 | 7 | 7.0% |
| 8 | 5 | 5.0% |
| 10 | 5 | 5.0% |
| 11 | 4 | 4.0% |
| 13 | 3 | 3.0% |
| 14 | 2 | 2.0% |
| Other values (21) | 21 |
| Value | Count | Frequency (%) |
| 2 | 1 | 1.0% |
| 3 | 15 | |
| 4 | 15 | |
| 5 | 15 | |
| 6 | 7 | |
| 7 | 8 | |
| 8 | 5 | 5.0% |
| 9 | 1 | 1.0% |
| 10 | 5 | 5.0% |
| 11 | 4 | 4.0% |
| Value | Count | Frequency (%) |
| 46 | 1 | |
| 44 | 1 | |
| 38 | 1 | |
| 34 | 1 | |
| 33 | 1 | |
| 32 | 1 | |
| 29 | 1 | |
| 28 | 1 | |
| 27 | 1 | |
| 25 | 1 |
| Distinct | 22 |
|---|---|
| Distinct (%) | 22.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.67 |
| Minimum | 53 |
|---|---|
| Maximum | 91 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 928.0 B |
Quantile statistics
| Minimum | 53 |
|---|---|
| 5-th percentile | 66.95 |
| Q1 | 79 |
| median | 81 |
| Q3 | 83 |
| 95-th percentile | 86 |
| Maximum | 91 |
| Range | 38 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 5.905065451 |
|---|---|
| Coefficient of variation (CV) | 0.07411905926 |
| Kurtosis | 5.969788398 |
| Mean | 79.67 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -2.040313074 |
| Sum | 7967 |
| Variance | 34.86979798 |
| Monotonicity | Decreasing |
Histogram with fixed size bins (bins=22)
| Value | Count | Frequency (%) |
| 80 | 16 | |
| 82 | 13 | |
| 81 | 13 | |
| 84 | 10 | |
| 83 | 8 | |
| 79 | 8 | |
| 86 | 5 | 5.0% |
| 76 | 3 | 3.0% |
| 72 | 3 | 3.0% |
| 66 | 3 | 3.0% |
| Other values (12) | 18 |
| Value | Count | Frequency (%) |
| 53 | 1 | 1.0% |
| 56 | 1 | 1.0% |
| 66 | 3 | |
| 67 | 1 | 1.0% |
| 70 | 2 | |
| 71 | 1 | 1.0% |
| 72 | 3 | |
| 74 | 1 | 1.0% |
| 75 | 1 | 1.0% |
| 76 | 3 |
| Value | Count | Frequency (%) |
| 91 | 1 | 1.0% |
| 88 | 2 | 2.0% |
| 86 | 5 | 5.0% |
| 85 | 2 | 2.0% |
| 84 | 10 | |
| 83 | 8 | |
| 82 | 13 | |
| 81 | 13 | |
| 80 | 16 | |
| 79 | 8 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| title | artist | top genre | year | beats.per.minute | energy | danceability | loudness.dB | liveness | valance | length | acousticness | speechiness | popularity | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Blinding Lights | The Weeknd | canadian contemporary r&b | 2020 | 171 | 73 | 51 | -6 | 9 | 33 | 200 | 0 | 6 | 91 |
| 1 | Watermelon Sugar | Harry Styles | pop | 2019 | 95 | 82 | 55 | -4 | 34 | 56 | 174 | 12 | 5 | 88 |
| 2 | Mood (feat. iann dior) | 24kGoldn | cali rap | 2021 | 91 | 72 | 70 | -4 | 32 | 73 | 141 | 17 | 4 | 88 |
| 3 | Someone You Loved | Lewis Capaldi | pop | 2019 | 110 | 41 | 50 | -6 | 11 | 45 | 182 | 75 | 3 | 86 |
| 4 | Perfect | Ed Sheeran | pop | 2017 | 95 | 45 | 60 | -6 | 11 | 17 | 263 | 16 | 2 | 86 |
| 5 | Believer | Imagine Dragons | modern rock | 2017 | 125 | 78 | 78 | -4 | 8 | 67 | 204 | 6 | 13 | 86 |
| 6 | lovely (with Khalid) | Billie Eilish | electropop | 2018 | 115 | 30 | 35 | -10 | 10 | 12 | 200 | 93 | 3 | 86 |
| 7 | Circles | Post Malone | dfw rap | 2019 | 120 | 76 | 70 | -3 | 9 | 55 | 215 | 19 | 4 | 86 |
| 8 | Shape of You | Ed Sheeran | pop | 2017 | 96 | 65 | 83 | -3 | 9 | 93 | 234 | 58 | 8 | 85 |
| 9 | Memories | Maroon 5 | pop | 2021 | 91 | 33 | 78 | -7 | 8 | 60 | 189 | 84 | 6 | 85 |
Last rows
| title | artist | top genre | year | beats.per.minute | energy | danceability | loudness.dB | liveness | valance | length | acousticness | speechiness | popularity | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 90 | CAN'T STOP THE FEELING! (from DreamWorks Animation's "TROLLS") | Justin Timberlake | dance pop | 2016 | 113 | 83 | 67 | -6 | 10 | 70 | 238 | 1 | 7 | 72 |
| 91 | Lean On | Major Lazer | dance pop | 2015 | 98 | 81 | 72 | -3 | 56 | 27 | 177 | 0 | 6 | 71 |
| 92 | Despacito - Remix | Luis Fonsi | latin | 2019 | 178 | 80 | 65 | -4 | 7 | 86 | 230 | 23 | 18 | 70 |
| 93 | Lose Yourself | Eminem | detroit hip hop | 2014 | 171 | 74 | 69 | -5 | 37 | 6 | 321 | 1 | 27 | 70 |
| 94 | Without Me (with Juice WRLD) | Halsey | dance pop | 2019 | 136 | 51 | 74 | -6 | 18 | 45 | 229 | 36 | 7 | 67 |
| 95 | One Dance | Drake | canadian hip hop | 2016 | 104 | 61 | 79 | -6 | 32 | 43 | 174 | 1 | 6 | 66 |
| 96 | Sugar | Maroon 5 | pop | 2015 | 120 | 79 | 75 | -7 | 9 | 88 | 235 | 6 | 3 | 66 |
| 97 | Emotions | Mark Mendy | pop dance | 2021 | 126 | 83 | 66 | -5 | 40 | 74 | 172 | 5 | 29 | 66 |
| 98 | Cold Water | Major Lazer | dance pop | 2018 | 93 | 80 | 61 | -5 | 16 | 50 | 185 | 7 | 4 | 56 |
| 99 | I Took A Pill In Ibiza - Seeb Remix | Mike Posner | dance pop | 2016 | 102 | 73 | 67 | -7 | 9 | 66 | 198 | 3 | 10 | 53 |